Ruminative Reinforcement Learning: Improve Intelligent Inventory Control by Ruminating on the Past
نویسنده
چکیده
Reinforcement Learning (RL) can solve practical sequential decision problems, even when structures of the problems are less understood. However, some sequential decision problems intrinsically have structural parts that are easily to formulate and distinguish from less understood parts. Exploiting this knowledge may help improve performance of RL. This study proposed and investigated an approach to exploit the knowledge of structural parts of inventory management problems in the context of RL. The proposed method is motivated by human behavior of ruminating on what has happened and what would happen if alternative choices would have been taken. Our investigation provides an insight into RL mechanism and our experimental results show viability of the approach.
منابع مشابه
Using BELBIC based optimal controller for omni-directional threewheel robots model identified by LOLIMOT
In this paper, an intelligent controller is applied to control omni-directional robots motion. First, the dynamics of the three wheel robots, as a nonlinear plant with considerable uncertainties, is identified using an efficient algorithm of training, named LoLiMoT. Then, an intelligent controller based on brain emotional learning algorithm is applied to the identified model. This emotional l...
متن کاملHierarchical Functional Concepts for Knowledge Transfer among Reinforcement Learning Agents
This article introduces the notions of functional space and concept as a way of knowledge representation and abstraction for Reinforcement Learning agents. These definitions are used as a tool of knowledge transfer among agents. The agents are assumed to be heterogeneous; they have different state spaces but share a same dynamic, reward and action space. In other words, the agents are assumed t...
متن کاملIntelligent Inventory Control: Is Bootstrapping Worth Implementing?
The common belief is that using Reinforcement Learning methods (RL) with bootstrapping gives better results than without. However, inclusion of bootstrapping increases the complexity of the RL implementation and requires significant effort. This study investigates whether inclusion of bootstrapping is worth the effort when applying RL to inventory problems. Specifically, we investigate bootstra...
متن کاملThe Effectiveness of Self-Efficacy Training on Promoting Mental Health and Decreasing Ruminating in Divorced Women with Drug-Dependent Husbands
Objective: The aim of this study was to evaluate the effectiveness of self-efficacy training on promoting mental health and reducing ruminating in divorced women with drug-dependent husbands supported by Kohgiluyeh and Boyer-Ahmad Welfare Organization. Method: The present study was quasi-experimental with pretest-posttest design with a control group. The statistical population included all divo...
متن کاملA new Evolutionary Reinforcement Scheme for Stochastic Learning Automata
A stochastic automaton can perform a finite number of actions in a random environment. When a specific action is performed, the environment responds by producing an environment output that is stochastically related to the action. The aim is to design an automaton, using an evolutionary reinforcement scheme (the basis of the learning process), that can determine the best action guided by past ac...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- JCP
دوره 9 شماره
صفحات -
تاریخ انتشار 2014